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INTRODUCTION 
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Tumor  suppressor  genes  are  defined  as  elements  whose  loss  or  inactivation 
promotes  neoplastic  transformation  (1 ,2).  Mutations  in  tumor  suppressor  genes  play  a 
key  role  in  both  genetic  predisposition  of  different  forms  of  cancer  and  in  the  etiology  of 
spontaneous  malignancies.  The  goal  of  our  research  was  the  isolation  and  analysis  of 
tumor  suppressor  genes  whose  lost  or  inactivation  is  involved  in  the  formation  or 
progression  of  breast  cancer.  To  identify  new  tumor  suppressor  genes,  we  are  using  a 
recently  developed  genetic  suppressor  element  (GSE)  approach.  This  strategy  is  based 
on  the  expression  selection  of  transforming  GSEs,  which  promote  transformation 
presumably  by  suppressing  tumor  suppressor  genes  from  which  they  derived  (3,4). 

To  isolate  breast  cancer  tumor  suppressor  genes,  we  decided  to  screen  GSE 
library  specific  for  normal  breast  cells  (4).  This  library  is  enriched  for  sequences  derived 
from  putative  tumor  suppressor  genes,  which  is  increasing  the  possibility  of  their 
isolation  in  different  selection  procedures.  Previous  screening  of  this  library  showed  that 
it  contains  biologically  active  GSEs,  since  selection  for  the  tumor  formation  in  nude  mice 
led  to  the  isolation  of  several  transforming  elements.  The  strongest  of  the  transforming 
GSEs  was  used  for  cloning  its  corresponding  gene,  named  ING1  (4).  Expression  of 
ING1  is  upregulated  in  senescent  human  fibroblasts  (5),  while  ectopic  expression  of 
ING1  cDNA  leads  to  arrest  in  the  G1  phase  of  the  cell  cycle  or  induces  apoptosis  in 
several  cell  types  (6).  ING1  encodes  nuclear  protein  p33ING1  that  cooperates  with  p53  in 
growth  regulation  by  modulating  the  ability  of  p53  to  act  as  a  transcriptional  activator  (7). 
Limited  analysis  of  tumor  cell  lines  revealed  reduced  expression  of  ING1  in  several 
breast  carcinoma  cell  lines  (4),  which  indicates  its  involvement  in  the  development  of 
breast  cancer.  ING1  is  mapped  to  human  chromosome  13q34  (8),  and  loss  of 
heterozygosity  in  this  chromosomal  locus  has  been  reported  in  squamous-cell 
carcinomas  of  head  and  neck.  These  properties  of  ING1  allowed  defining  it  as  a 
candidate  tumor  suppressor  gene,  with  high  probability  of  involvement  in  the  breast 
cancer  formation  or  progression. 

Goal  of  my  research  was  the  isolation  of  the  new  transforming  GSEs  from  the 
GSE  library  specific  for  normal  breast  cells,  as  a  first  step  towards  the  isolation  of  the 
new  breast  tumor  suppressor  genes.  Since  the  screening  of  the  GSE  library  is  a  long 
process,  in  parallel,  I  decided  to  study  the  ING1  candidate  tumor  suppressor  gene,  since 
there  is  strong  indications  about  its  involvement  in  the  formation  of  breast  cancer.  All  the 
information  regarding  the  p33ING1  function  was  obtained  so  far  only  through  in  vitro 
experiments.  However,  in  order  to  really  understand  a  role  that  a  certain  gene  might 
have,  it  is  necessary  to  analyze  its  expression,  regulation,  and  function  in  the  living 
organism,  to  see  its  involvement  in  development  and  other  physiological  processes  that 
can  not  be  analyzed  on  isolated  tissue  culture  cells.  To  do  these  kind  of  studies,  it  is 
necessary  to  have  a  model  system  that  will  allow  easy  and  efficient  genetic  manipulation 
with  the  gene  of  interest.  In  order  to  do  detailed  analysis  of  ING1,  I  decided  to  use  a 
mouse  model,  which  is  the  ideal  system  for  different  kinds  of  genetic  studies.  As  a  first 
and  essential  step  towards  understanding  its  function  and  involvement  in  physiological 
processes  as  well  as  carcinogenesis,  I  analyzed  the  structure  and  regulation  of  the 
mouse  ingl  gene. 
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Cloning  and  structural  analysis  of  the  mouse  ING1  homologue  was  designed  as  a 
first  step  in  the  generation  of  ingl  knockout  animals.  Those  animals  would  provide 
valuable  data  on  the  role  of  the  gene  in  normal  development,  but  they  could  also  give 
important  insight  into  the  role  that  ING1  might  have  in  carcinogenesis.  The  involvement 
of  ING1  in  the  p53  signaling  pathway  suggests  that  it  could  be  a  tumor  suppressor  by 
itself,  in  which  case  ingl  knockout  may  lead  to  the  consequences  similar  to  those 
associated  with  p53  deficiency  in  animals. 


BODY 


In  order  to  isolate  GSEs  for  the  breast  cancer  tumor  suppressor  genes,  I  used 
GSE  library  prepared  from  cDNA  specific  for  normal  human  breast  epithelial  cells 
mentioned  in  introduction  (4).  This  library,  cloned  into  pLNCX  retroviral  vector,  is 
enriched  for  the  sequences  derived  from  tumor  suppressor  genes.  Previous  experiments 
showed  that  it  contain  biologically  active  GSEs,  which  is  making  it  a  good  source  for  the 
isolation  of  genes  whose  inactivation  is  involved  in  breast  cancer  development.  As  test 
cells  I  used  MCF10A  (9),  human  pseudonormal  immortalized  breast  cell  line  with  low 
efficiency  of  growth  in  semi-solid  media,  low  background  in  spontaneous  focus  formation 
and  non-tumorigenic  for  nude  mice  (unpublished  observation).  Library  was  delivered  to 
test  cells  using  BING  packaging  cell  line  (10)  for  virus  preparation.  Infected  cells  were 
used  in  three  different  selection  procedures:  selection  for  foci  formation,  selection  for 
colony  formation  in  semi-solid  agar,  and  selection  for  tumorigenicity  in  nude  mice.  In 
order  to  isolate  biologically  active  GSEs  several  rounds  of  selection  should  be 
performed,  and  this  work  is  still  in  progress. 

In  parallel,  I  studied  the  ING1  candidate  tumor  suppressor  gene,  since  there  is 
strong  indications  about  its  involvement  in  the  formation  of  breast  cancer.  In  order  to  do 
detailed  analysis  of  ING1 ,  I  used  a  mouse  model,  which  is  the  ideal  system  for  different 
kinds  of  genetic  studies.  The  first  step  in  this  approach  was  the  isolation  of  the  mouse 
ING1  homologue.  In  the  previous  year,  I  isolated  mouse  ingl,  determined  its  structure, 
including  precise  mapping  of  the  beginning  and  end  points  of  ingl  transcription  as  well 
as  exon-intron  structure,  and  analyzed  its  expression  and  regulation. 

Mouse  ingl  is  transcribed  from  three  differently  regulated  promoters  localized 
within  a  4  kb  segment  of  genomic  DNA.  Two  of  them  are  TATA-less  promoters  with  a 
loose  initiator  sequence  that  includes  a  few  tightly  clustered  transcription  start  sites  and 
multiple  Spl  binding  sites  upstream  from  the  initiator.  The  third  promoter  has  TATA-like 
box,  putative  CAAT  box  and  multiple  Spl  binding  sites.  Three  alternative  transcripts  of 
ingl  share  a  long  common  region  encoded  by  a  common  exon  and  differ  in  their  5’-exon 
sequences  encoded  by  alternative  5’-exons.  Only  one  of  the  5’-end  exons  contains  a 
protein-coding  sequence,  while  translation  of  the  two  other  transcripts  starts  from  the 
initiator  codon  within  the  common  region.  Three  alternative  transcripts  of  ingl  encode 
two  proteins  (p37in91  and  p31 ,ng1 ),  one  of  which  is  a  truncated  version  of  the  other  lacking 
104  N-terminal  amino  acids.  Protein  sequences  of  mouse  and  human  ingl  genes  show 
a  high  degree  of  similarity,  especially  in  their  C-terminal  regions  that  contain  a  PHD 
finger  domain.  At  the  nucleotide  level,  sequence  similarity  between  human  and  mouse 
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cDNAs  was  high  throughout  the  whole  coding  regions  and  much  lower  in  the  3’- 
untranslated  region.  Several  conserved  regions  localized  within  3’-untranslated 
sequences  of  ingl  possibly  map  some  functional  domains  within  ingl  mRNA  that  could 
be  involved  in  regulation  of  the  mRNA  transport,  translation  or  stability. 

Ingl  is  expressed  in  adult  mouse  tissues  at  moderate  levels,  with  the  highest 
expression  in  the  thymus  and  testis.  Two  isoforms  are  ubiquitously  expressed  with  the 
highest  levels  in  the  thymus,  while  the  third  is  shown  to  be  testis  specific.  Ingl  is 
expressed  during  the  entire  embryogenesis  with  the  highest  levels  in  the  yolk  sac  of  the 
early  embryo  and  inner  compartments  of  bones  of  the  later  stage  embryos.  Analysis  of 
the  cell  lines  showed  that  ingl  expression  correlates  with  cell  proliferation  suggesting  a 
connection  between  cell  growth  regulation  and  ingl  expression.  Differences  between 
expression  patterns  at  mRNA  and  protein  levels  indicate  possible  regulation  of  ingl  at 
the  levels  of  translation  or  protein  stability. 

The  work  concerning  structure  and  regulation  of  mouse  ingl  gene  is  submitted  to 
Molecular  and  Cellular  Biology,  and  I  am  sending  a  copy  of  the  manuscript  in  Apendix. 
The  manuscript  includes  detailed  explanation  about  how  this  work  is  done,  including 
experimental  methods,  results  and  discussion. 

In  order  to  make  ingl  deficient  animals  we  decided  to  delete  common  exon  of 
ingl  in  their  genome.  To  do  that,  we  applied  widely  used  technique  of  positive/negative 
selection  (11).  Targeting  vector,  containing  a  neomycin  resistance  ( neo )  gene  within 
homologous  regions  of  the  target  sequence  and  a  herpes  simplex  virus  thymidine  kinase 
(TK)  gene  outside  of  these  regions,  was  made  and  introduced  into  embryonic  stem  (ES) 
cells.  If  site-specific  recombination  occurs  in  cells,  integration  of  neo  and  elimination  of 
TK  gene  will  be  observed.  Selection  of  transfected  ES  cells  to  both  G418  and  ganciclovir 
resulted  in  isolation  of  cell  population  enriched  with  desired  clones.  Legitimate 
homologous  recombination  in  these  selected  ES  cells  was  confirmed  by  Southern  blot 
hybridization,  which  revealed  four  colonies  with  heterozygous  deletion  within  ingl  gene. 
ES  cells  with  a  heterozygous  deletion  of  the  ingl  common  exon  are  currently  being  used 
for  the  generation  of  ingl  knockout  mice. 


CONCLUSIONS 

As  a  first  step  towards  the  better  understanding  of  the  mechanisms  of  function  of 
ingl  gene  as  well  as  its  role  in  carcinogenesis,  I  analyzed  its  structure  and  regulation.  I 
precisely  mapped  the  beginning  and  end  points  of  ingl  transcription  and  determined 
exon-intron  structure  of  the  gene.  The  mouse  ingl  gene  is  transcribed  from  three 
differently  regulated  promoters.  Three  alternative  transcripts  of  ingl  share  a  long 
common  region  encoded  by  a  common  exon  and  differ  in  their  5’-exon  sequences 
encoded  by  alternative  5’-exons.  Only  one  of  the  5’-exons  contains  protein-coding 
sequence  while  translation  of  two  other  transcripts  starts  from  the  initiator  codon  within 
the  common  region.  Three  alternative  transcripts  of  ingl  thus  encode  two  different 
proteins,  different  only  in  size.  Comparison  of  the  mouse  and  human  Ingl  sequences 
reveals  extremely  high  levels  of  homology. 
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The  results  of  expression  analysis  indicated  that  mouse  ingl  is  a  subject  of 
regulation  both  at  the  protein  and  mRNA  levels  and  that  different  isoforms  have  different 
expression  patterns.  Expression  of  ingl  mRNA  correlates  with  cell  proliferation 
suggesting  a  connection  between  cell  growth  regulation  and  ingl  expression. 

First  step  in  the  generation  of  ingl  knockout  mice,  preparation  of  ES  cells  with  a 
heterozygous  deletion  of  the  ingl  common  exon,  is  finished.  Completion  of  this  project 
should  give  valuable  information  regarding  the  possible  role  of  this  gene  in 
carcinogenesis. 
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I  am  including  in  my  report  manuscript  of  the  paper  submitted  to  Molecular  and 
Cellular  Biology,  which  is  explaining  in  details  results  of  the  study  addressing  questions 
of  ingl  structure  and  regulation. 
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ABSTRACT 


Structure  and  expression  of  the  mouse  homologue  for  the  human  ING1  candidate 
tumor  suppressor  gene  were  analyzed.  Mouse  ingl  is  transcribed  from  three 
differently  regulated  promoters  localized  within  a  4  kb  region  of  genomic  DNA. 
Each  of  the  resulting  transcripts  represent  two  exons;  they  share  a  long  common 
region  encoded  by  a  common  exon  and  differ  in  their  5’ -exon  sequences  encoded 
by  alternative  5’ -exons.  Only  one  of  the  5’-exons  contains  protein-coding 
sequence;  translation  of  two  other  transcripts  starts  from  the  initiator  codon 
within  the  common  region.  Three  alternative  transcripts  of  ingl  thus  encode  two 
proteins  (p37,n91  and  p31'ng1),  one  of  which  is  a  truncated  version  of  the  other 
lacking  104  N-terminal  amino  acids.  Protein  sequences  encoded  by  mouse  and 
human  ingl  are  highly  conserved,  especially  in  their  C-terminal  regions  that 
contain  a  PHD  finger  domain.  p33ING1  belongs  to  a  small  family  of  proteins  from 
humans  and  yeast  that  have  approximately  the  same  size,  show  highly  significant 
similarity  to  one  another  within  the  PHD  finger  domain  and  also  contain  an 
additional,  N-terminal  region  with  subtle  but  reliably  detectable  sequence 
conservation.  Expression  of  ingl  mRNA  correlates  with  cell  proliferation 
suggesting  a  connection  between  cell  growth  regulation  and  ingl  expression. 
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The  ING1  candidate  tumor  suppressor  was  identified  as  a  result  of  a  functional 
screening  of  genes,  the  suppression  of  which  is  associated  with  neoplastic 
transformation  (5).  Inhibition  of  ING1  expression  by  antisense  RNA  promotes  anchorage 
independent  growth  in  mouse  breast  epithelial  cells,  increases  the  frequency  of  focus 
formation  in  NIH  3T3  cells,  and  prolongs  the  life  span  of  diploid  human  fibroblasts  in 
culture.  ING1  expression  is  upregulated  in  senescent  human  fibroblasts  (6),  and  ectopic 
expression  of  ING1  cDNA  leads  to  G1  arrest  or  promotes  apoptosis  in  several  cell  types 
(12).  The  accumulated  observations  indicating  ING1  participation  in  the  negative 
regulation  of  cell  proliferation,  control  of  cellular  aging,  and  apoptosis  have  defined  ING1 
as  a  candidate  tumor  suppressor  gene.  ING1  was  mapped  to  human  chromosome 
13q34  (7,  26),  and  loss  of  heterozygosity  in  this  chromosomal  locus  has  been  reported 
in  squamous  cell  carcinomas  of  the  head  and  neck  (13). 

We  have  recently  found  that  the  biological  effects  of  ING1  and  p53  are 
interrelated  and  require  the  activity  of  both  genes:  neither  of  the  two  genes  can,  on  its 
own,  cause  growth  inhibition  when  the  other  one  is  suppressed  (9).  Furthermore, 
activation  of  transcription  from  the  p21/WAF1  promoter,  a  key  mechanism  of  p53- 
mediated  growth  control,  depends  on  the  expression  of  ING1.  A  physical  association 
between  p33ING1  and  p53  proteins  is  detected  by  immunoprecipitation.  These  results 
indicated  that  p33ING1  is  a  component  of  the  p53  signaling  pathway,  and  that  p33ING1 
cooperates  with  p53  in  negative  regulation  of  cell  proliferation  by  modulating  p53- 
dependent  transcriptional  activation. 
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In  spite  of  the  apparent  importance  of  ING1  in  the  control  of  cell  proliferation,  our 
knowledge  of  expression,  regulation,  and  function  of  this  gene  remains  incomplete. 
Moreover,  GenBank  contains  two  versions  of  ING1  mRNA  sequences  differing  in  their 
5’-ends;  the  origin  of  these  differences  is  unknown  and  requires  explanation.  So  far,  all 
of  the  information  about  the  function  of  p33ING1  was  obtained  from  in  vitro  experiments 
that  involved  ectopic  expression  of  ING1  cDNA  or  its  suppression  by  antisense  RNA. 
The  analysis  of  the  structure  of  the  ING1  gene  and  its  regulation  in  vivo  is  an  essential 
step  towards  the  understanding  of  its  function  and  involvement  in  developmental  and 
physiological  processes.  This  is  particularly  important  due  to  the  cooperation  between 
ING1  and  p53,  which  suggests  that  the  functioning  of  the  p53  signaling  pathway  could 
be  dependent  on  the  regulation  of  ING1  expression. 

In  the  present  work,  we  used  the  mouse  orthologue  of  human  the  ING1  gene  for 
detailed  structural  and  expression  studies,  with  the  goal  of  subsequently  utilizing  it  as  a 
system  for  extensive  genetic  analysis.  Ingl  was  found  to  be  a  highly  evolutionarily 
conserved  gene  with  complex  regulation  which  involves  generation  of  alternative 
transcripts  initiated  from  different  promoters  and  translated  into  proteins  that  differ  in 
structure  and  expression  patterns. 

MATERIALS  AND  METHODS 

Animals 

Organs  and  embryos  of  FVB/N  mice  were  used  for  the  RNA  isolation  and  preparation  of 
whole-body  sections  of  embryos  using  cryostatic  microtome  for  histo-blot  hybridization. 
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Cell  lines 


Cultures  of  mouse  embryonic  fibroblasts  were  obtained  from  10  day-old  embryos.  10(1) 
cell  line,  a  derivative  of  Balb  3T3  cells  that  spontaneously  deleted  both  p53  alleles  (11), 
was  kindly  provided  by  Arnold  Levine.  Pseudonormal  mouse  mammary  gland  epithelial 
cell  line  (NMuMG)  was  obtained  from  the  ATCC  collection.  Ecotropic  retroviral 
packaging  cell  line  BOSC23  (16)  was  kindly  provided  by  Warren  Pear  and  David 
Baltimore  (Massachusetts  Institute  of  Technology).  All  cells  were  maintained  in 
Dulbecco’s  modified  Eagle  medium  containing  10%  fetal  bovine  serum  supplemented 
with  penicillin  and  streptomycin.  For  the  serum  starvation  experiments,  10(1)  and 
NMuMG  cells  were  kept  with  0.5%  fetal  bovine  serum  for  48  hours  (NMuMG  cells)  or  36 
hours  (10(1)  cells).  For  the  contact  inhibition  experiments,  cells  cultures  were  used  48 
hours  (NmuMG)  or  36  hours  (10(1))  after  they  became  confluent.  NMuMG  and  10(1) 
cells  were  irradiated  with  10  Gy  of  y-radiation  and  used  24  hours  after  treatment.  Mouse 
embryo  fibroblasts  were  propagated  until  they  reached  senescence. 


Plasmids  and  libraries 

Retroviral  vector  pLXSN,  used  for  the  introduction  of  mouse  and  human  ING1  cDNA  in 
NMuMG  and  10(1)  cells,  was  provided  by  A.  Dusty  Miller  (14).  A  cDNA  library  from 
senescent  mouse  embryonic  fibroblasts  was  constructed  using  Superscript  system 
(Gibco  BRL)  according  to  the  manufacturer’s  protocol,  and  cloned  into  the  lambda 
phage  X  gt22A  vector.  Stratagene’s  129SVJ  Mouse  Genomic  Library  cloned  into  the 
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Lambda  FIX  II  vector  was  used  for  the  isolation  of  the  mouse  genomic,  ingl  containing 
clones. 


Jk 


Hybridization  screening  of  cDNA  and  genomic  libraries 

cDNA  and  genomic  library  screenings  were  done  according  to  the  standard  protocols 
(18)  using  32P-labeled  human  and  mouse  ingl  cDNA  probes,  respectively. 


5’  and  3’  RACE 

Alternative  5’  ends  of  the  mouse  ingl  were  isolated  from  mouse  spleen  and  mouse  brain 
cDNAs  using  the  Marathon-ready  cDNA  kit  (CLONTECH  Laboratories),  according  to  the 
protocol  suggested  by  the  manufacturer.  API  adaptor-specific  sense  primer,  provided  by 
CLONTECH,  and  the  ing  •/-specific  antisense  primer  (5’ 

CC AT CT GACT CACG AT CT GGAT CTTC  -  3’)  were  used  for  PCR.  Nested  PCR  was 
performed  using  AP2  adaptor-specific  sense  primer,  provided  by  CLONTECH,  and  ingl- 
specific  antisense  primer  (5’  -  CTGCGGATCAGGGCCCTCTGGATGC  -  3’).  Precise 
determination  of  the  5’  and  3’  ends  of  the  mouse  ingl  transcripts  was  done  using  the 
Marathon-2  cDNA  Amplification  kit  based  on  the  new  SMART  PCR  cDNA  synthesis 
technology  (CLONTECH).  It  is  based  on  the  recently  identified  ability  of  M-MLV  reverse 
transcriptase  to  add  several  nucleotides  to  the  3’-terminus  of  first-strand  cDNA  during 
the  RT  reaction  (3).  Briefly,  when  reverse  transcriptase  reaches  the  5’end  of  the  mRNA, 
it  switches  templates  and  continues  synthesizing  the  SMART  template-switching  (TS) 
oligonucleotide.  The  resulting  single-stranded  cDNA  contains  the  complete  5’end  of  the 
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mRNA  as  well  as  the  sequence  complementary  to  the  TS  oligonucleotide  and  is  then 
selectively  amplified  by  PCR.  In  these  experiments,  poly(A)  RNA  preparations  isolated 
from  the  thymus  and  testis  were  used.  The  following  sequences  were  used  for  the 
syntheis  of  antisense  /ngf-specific  primers:  5’  -  AGGTGTGGTGGGATCGGCAACGC  -  3’ 
(for  is*oform  la),  5’  -  CGCGGGGAGCCAGAGCAGAGAAGGT  -  3’  (isoform  1c),  and  5’  - 
GGCGTGGCCTGTCATTGTCGCTG  -  3’  (isoform  1b).  Ing -/-specific  sense  primer  5’  - 
GCGTGCTTCTTGCTACCAT  -  3’  was  used  for  the  PCR  amplification  of  the  3’  end. 

Sequence  analysis 

Sequencing  was  done  using  a  Sequenase  Version  2.0  DNA  Sequencing  Kit  (USB),  or  by 
the  University  of  Chicago  Cancer  Research  Center  DNA  Sequencing  Facility.  In  all 
cases,  both  strands  were  read  using  multiple  vector-specific  or  gene-specific  primers. 
Protein  sequence  database  searches  were  performed  using  the  gapped  BLASTP 
program  and  the  PSI-BLAST  program  that  iterates  the  search  using  profiles  constructed 
from  BLAST  hits  as  queries  for  subsequent  iterations  (2).  Multiple  sequence  alignments 
were  constructed  using  the  Gibbs  sampling  option  of  the  MACAW  program  (15,  19). 

Southern,  northern,  and  western  analyses  were  done  according  to  standard  protocols. 
IgGI  mouse  monoclonal  antibody  against  human  recombinant  p33ING1  (8)  was  used  for 
the  detection  of  ingl -encoded  proteins. 
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In  situ  hybridization  (“histo-blotting”) 

Whole-body  sections,  20  pm  thick,  were  prepared  from  frozen  embryos  embedded  in 
blocks  of  Tissue-Tek  O.C.T.  and  stored  at  -70°C.  Sections  were  placed  on  nitrocellulose 
(Schleicher&Schuell)  to  prepare  “histo-blots”  as  previously  described  (10).  Histo-blots 
were  hybridized  with  33P-labeled  RNA  probes  synthesized  using  Ambion’s  MAXIscript  In 
vitro  Transcription  Kit.  Ingl  specific  sense  and  antisense  probes  were  synthesized  on 
the  pBLUESCRIPT  plasmid  with  the  fragment  of  ingl  cDNA  corresponding  to  the  PHD 
finger  domain  using  T3  polymerase  for  the  antisense  and  T7  polymerase  for  the  sense 
probe,  p-actin  antisense  RNA  was  synthesized  using  the  template  provided  by  Ambion. 
Histo-blots  were  incubated  in  pre-hybridization  solution  (0.75  mg/ml  yeast  tRNA,  0.75 
mg/ml  poly(A),  50%  formamide,  0.3M  Tris  pH  8.0,  ImM  EDTA,  5x  Denhardt’s  solution, 
10%  Dextran  sulfate,  10mM  DTT)  at  42°C  for  1-4  hours.  Hybridization  was  carried  out 
for  12-24  hours  at  42°C  in  the  same  solution  containing  1-5x1 07cpm  of  probe  per  ml. 
After  hybridization,  histoblots  were  washed  in  4x  SSC  at  room  temperature  for  20  min,  in 
2x  SSC,  0.04  pg/ml  RNase  A  at  37°C  for  30  min,  in  2x  SSC  at  37°C  for  30  min,  in  lx 
SSC  at  60°C  for  15-30  min  and  finally  in  0.1  x  SSC  at  60°C  for  15-30  min.  and  exposed 
to  X-ray  film  for  2-7  days. 

RESULTS 

Multiple  transcripts  of  the  mouse  ingl  gene  differ  in  their  5’ -end  sequences 

In  order  to  isolate  the  mouse  orthologue  of  the  ING1  gene,  we  screened  a  cDNA 
library  prepared  from  senescent  mouse  embryonic  fibroblasts,  using  human  ING1  as  a 
probe.  The  choice  of  the  library  was  determined  by  the  fact  that  ING1  is  expressed  at 
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higher  levels  in  senescent  compared  to  normal  human  fibroblasts  (6).  As  a  result, 
several  clones  were  isolated,  and  the  longest  two  were  sequenced.  The  clones  were 
identical  to  each  other  and  highly  similar  to  human  ING1  through  most  of  their  length 
except  for  the  5’  ends  which  were  different  and  not  homologous  to  the  human  gene 
(Figure  1).  This  observation  could  be  an  indication  of  alternative  splicing  of  mouse  ingl, 
but  it  could  also  be  potentially  explained  by  cloning  artifacts  occurring  during  the  cDNA 
library  preparation.  To  determine  the  structure  of  5’-end  sequences  of  ingl,  we  used  a 
5’-RACE  technique  for  the  isolation  of  cDNA  sequences  corresponding  to  the  5’-termini 
of  ingl  mRNA.  cDNA  was  synthesized  from  mouse  spleen  and  brain  mRNA,  and  ligated 
to  synthetic  adaptors.  PCR  was  performed  on  the  cDNA  using  a  sense  primer  specific 
for  the  adaptor  and  an  antisense  primer  specific  for  the  common  part  of  ingl  located 
close  to  the  divergent  region.  Two  fragments  of  different  size  were  obtained  and  cloned, 
sequenced,  and  compared  to  the  clones  isolated  from  the  cDNA  library.  The  results  of 
this  comparison  are  schematically  presented  in  Figure  1.  One  of  the  5’-RACE  products 
was  identical  to  one  of  the  previously  isolated  cDNA  clones.  Another  product  revealed 
the  third  variant  of  ingl  cDNA,  which  again  consisted  of  common  and  unique  regions 
with  the  junction  located  exactly  at  the  same  position  as  in  the  other  sequences.  Figure 
1  shows  schematic  alignments  of  the  isolated  mouse  ingl  cDNA  clones  and  the  5’ 
RACE  products.  All  variants  are  identical  to  each  other  (and  homologous  to  human 
ING1)  up  to  the  same  nucleotide,  and  different  only  in  their  5’  ends. 

In  order  to  verify  that  the  obtained  cDNA  structure  reflects  naturally  existing 
transcripts,  we  analyzed  ingl  mRNA  species  by  Northern  hybridization  using  probes 
representing  common  or  specific  regions  of  isolated  cDNAs.  As  shown  in  Figure  2,  the 
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probe  for  the  common  region  revealed  multiple  transcripts  in  RNA  isolated  from  mouse 
liver,  heart,  and  testis.  Probes  specific  for  individual  transcripts,  however,  showed  more 
simple  hybridization  patterns,  which  in  combination  covered  the  whole  set  of  transcripts 
found  by  hybridization  with  the  common  probe.  These  results  indicated  that  the  cloned 
sequences  represent  the  majority  of  multiple  transcripts  of  ingl  synthesized  in  the 
thymus  and  spleen. 

Mapping  of  coding  regions  of  ingl  in  mouse  genomic  DNA 

Comparison  of  sequences  of  individual  cDNA  clones  showed  differences  in  their 
5’-regions  suggesting  that  ingl  has  multiple  alternative  5’-exons  (Fig.  1).  To  verify  this 
hypothesis,  we  determined  the  structure  of  the  mouse  ingl  gene.  We  isolated  phage 
clones  carrying  sequences  homologous  to  the  mouse  ingl  cDNA  by  hybridization 
screening  of  a  mouse  genomic  library.  These  clones  were  mapped  by  restriction 
digestion  analysis,  in  combination  with  Southern  blot  hybridization,  with  the  probes 
corresponding  to  the  different  ingl  parts.  The  interpretation  of  the  obtained  results  is 
shown  in  Figure  3. 

Southern  blot  hybridization  analysis  of  ingl -related  sequences  in  the  mouse 
genome  indicated  that  ingl  is  a  single  gene  with  no  obvious  close  family  members. 
Comparison  of  genomic  and  cDNA  sequences  of  ingl  using  PCR  with  different  ingl 
specific  primers  (data  not  shown),  as  well  as  with  the  sequencing  data,  showed  that 
most  of  the  transcribed  sequences  of  the  ingl  gene  come  from  a  single  exon.  Alternative 
5’  ends  are  encoded  by  different  exons  positioned  upstream  from  the  common  exon. 
Comparison  of  the  sequences  of  cDNA  and  genomic  clones,  as  well  as  5’  RACE 
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products,  revealed  three  isoforms  of  mouse  ingl  that  differ  from  each  other  only  in  their 
5’  ends,  which  also  indicates  that  each  isoform  is  most  probably  expressed  from  its  own 
promoter. 

The  length  of  the  isolated  cDNA  clones  appeared  to  be  significantly  shorter  than 
that  of  the  mRNA  species  detected  by  Northern  hybridization  (Fig.  2),  suggesting  that 
part  of  the  transcribed  sequences  were  missing  from  the  isolated  cDNAs.  To  determine 
the  exact  start  sites  of  ingl  transcription,  we  used  a  new  procedure  called  “SMART- 
based  5’RACE”  as  described  in  Materials  and  Methods.  Using  this  method,  we  were  able 
to  extend  the  cDNA  sequences  for  the  5’end  of  isoform  1b.  This  exon  contains  an 
extremely  G/C  rich  region  that  probably  resulted  in  the  incomplete  sequence  obtained 
from  the  original  5’-RACE  (Figure  4). 

Comparing  the  3’ends  of  human  and  mouse  ING1  cDNAs  showed  that  although 
the  mouse  transcript  was  flanked  by  a  poly(A)  stretch,  its  3’-untranslated  region  was 
significantly  shorter  than  the  human  one.  Analysis  of  the  genomic  clone  with  the  mouse 
ingl  sequence  revealed  the  presence  of  a  long  poly(A)  stretch  that  could  potentially  be 
used  as  a  primer-binding  sequence  for  reverse  transcription  initiated  from  oligo(dT) 
primers.  Moreover,  the  alignment  with  the  3’  end  of  the  human  ING1  cDNA  sequence 
resumes  downstream  from  this  genomic  polyA  stretch.  The  exact  end  of  ingl 
transcription  in  mouse  was  determined  using  the  same  method  applied  for  the 
generation  of  5’-sequences.  Mouse  ingl  cDNA  was  in  fact  found  to  be  longer  than  was 
originally  thought  (Figure  4).  Sequence  comparison  showed  that  mouse  and  human 
ING1  transcripts  terminate  at  the  same  point,  and  share  significant  levels  of  similarity  up 
to  the  very  3’  ends. 
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Sequence  analysis  of  ingl 

Nucleotide  and  predicted  amino  acid  sequences  of  the  three  mouse  ingl  cDNA 
isoforms  and  the  promoter  regions  for  all  three  alternative  transcripts  are  shown  in 
Figures  5  and  6.  It  was  found  that  the  la  and  1c  ingl  isoforms  have  several  tightly 
clustered  transcription  start  sites.  Sequences  upstream  of  the  initiation  sites  (putative 
promoters)  lack  TATA  boxes,  but  they  do  contain  a  sequence  corresponding  to  the  loose 
initiator  (Inr)  consensus  PyPyANT/APyPy  that  includes  the  transcription  start  sites 
(Figure  5).  In  both  cases,  areas  upstream  from  the  Inr  are  extremely  GC-rich  with 
multiple  Spl -binding  sites.  All  of  these  features  are  typical  for  TATA-less  promoters  (21, 
22).  Binding  sites  of  inducible  factors  that  are  usually  present  further  upstream  in  the 
promoter  area  (e.g.  CAAT  box,  Oct-1  and  Oct-2  binding  site,  NFkB  or  ATF  binding  site) 
were  not  found  in  these  promoters.  Transcription  of  the  ingl  isoform  1b  starts 
approximately  30  nucleotides  downstream  from  the  TATA-like  box.  A  putative  CAAT  box 
is  present  60  nucleotides  upstream  from  the  TATA-like  sequence.  Since  the  transcription 
initiation  sites  for  the  la  and  1b  isoforms  are  only  about  200  bp  apart,  there  is  a 
possibility  that  their  promoters  share  some  of  the  regulatory  sequences  including,  for 
example,  a  number  of  Spl  binding  sites  located  upstream  from  this  area. 

All  three  alternative  transcripts  of  mouse  ingl  contain  the  same  long  open  reading 
frame,  although  the  sizes  of  the  predicted  proteins  are  different.  While  isoform  1b 
encodes  a  protein  of  279  amino  acids,  the  other  two  isoforms  are  predicted  to  encode  a 
shorter  protein  product  of  175  amino  acids,  which  lacks  104  N-terminal  amino  acids. 
Translation  of  these  two  products  is  expected  to  start  from  the  initiation  codon,  which  is 
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located  in  the  beginning  of  the  common  exon.  Alignment  of  the  predicted  amino  acid 
sequences  with  that  of  human  p33ING1  (GenBank  accession  number  AF044076) 
revealed  high  similarity  between  the  mouse  and  human  proteins  (89%  sequence 
identity)  (Fig.  7a).  At  the  nucleotide  level,  sequence  similarity  between  the  human  and 
mouse  cDNAs  was  high  throughout  the  whole  coding  regions  and  much  lower  in  the  3’- 
untranslated  region  that  contained,  however,  several  conserved  stretches  of  nucleotides 
(Fig.  7b). 

The  non-redundant  protein  sequence  database  at  NCBI  was  searched  using  the 
ingl-e ncoded  protein  sequence  as  a  query  and  a  highly  significant  sequence  similarity 
was  detected  with  a  human  paralogue  of  p33ING1,  three  uncharacterized  proteins  from 
the  budding  yeast  Saccharomyces  cerevisiae\  and  their  homologue  from  fission  yeast 
Schizosaccharomyces  pombe  (probability  of  the  similarity  being  observed  by  chance 
<10'12).  All  of  these  proteins  have  approximately  the  same  size  and  contain  a  C-terminal 
PFID  finger  domain  (1,  20).  The  sequence  conservation  in  the  PFID  domain  in  these  six 
protein  sequences  is  striking,  with  13  invariant  residues,  in  addition  to  the  8  metal¬ 
chelating  cysteines  and  histidines  that  are  conserved  in  all  PFID  fingers  (Fig.  8).  Multiple 
alignment  analysis  resulted  in  the  delineation  of  an  additional  N-terminal  region  that  is 
conserved  in  these  proteins  (Fig.  8).  This  region  consists  of  approximately  100  amino 
acid  residues,  includes  two  distinct  conserved  motifs  (Fig.  8),  and  shows  a  fairly  subtle 
similarity  that  was  not  statistically  significant  in  the  context  of  the  screening  of  the 
complete  database,  except  for  the  conservation  between  the  two  human  paralogs. 
Flowever,  in  the  reduced  search  space  defined  by  the  presence  of  the  PFID  finger,  it  was 
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shown  that  the  probability  of  finding  this  level  of  similarity  by  chance  was  below  10'13  for 
the  distal  motif  and  below  10'5  for  the  proximal  motif. 

Expression  of  ingl  transcripts  in  adult  and  embryonic  mouse  tissues 

We  analyzed  the  expression  of  ingl  mRNA  in  the  organs  of  adult  mice  and 
mouse  embryos  at  different  stages  of  development  by  Northern  blot  hybridization.  The 
probe  representing  the  common  exon  of  the  ingl  gene  revealed  multiple  mRNA  size 
classes  that  represent  alternative  transcripts  of  ingl  (Figure  9).  The  overall  ingl  mRNA 
expression  is  most  abundant  in  thymus  and  testis;  much  lower  levels  were  detected  in 
the  rest  of  the  tissues  tested  that  also  differed  in  the  content  and  relative  intensity  of  the 
hybridizing  bands.  The  same  pattern  of  ingl  mRNA  expression  was  observed  in  p53- 
deficient  mice  showing  no  direct  effect  of  p53  on  ingl  regulation  (data  not  shown).  In 
embryos,  the  highest  expression  was  found  on  the  11th  day  of  development  and  was 
characterized  by  changes  in  the  relative  expression  of  different  classes  of  mRNA 
transcripts. 

Patterns  of  ingl  expression  in  embryogenesis  were  also  checked  by  in  situ 
hybridization.  We  used  the  histoblotting  technique,  in  which  embryonic  sections  were 
fixed  on  a  nitrocellulose  membrane,  and  then  hybridized  with  the  RNA  probes 
corresponding  to  the  common  part  of  the  ingl  cDNA.  Besides  sense  and  antisense  ingl 
specific  RNA  probes,  we  also  used  an  antisense  probe  for  p-actin,  a  gene  with 
ubiquitous  expression,  as  an  internal  standard.  Sections  were  prepared  from  10-,  12-, 
16-  and  18-day  embryos.  Results  of  in  situ  hybridization  are  shown  in  Figure  10.  Ingl  is 
uniformly  expressed  in  the  whole  mouse  embryo  at  all  stages  of  development  examined. 


21 


This  is  in  agreement  with  the  result  obtained  with  the  adult  mouse  tissues,  where  ingl  is 
expressed  ubiquitously  and  at  similar  levels  in  all  analyzed  organs.  However,  at 
embryonic  day  10  higher  expression  levels  were  observed  in  the  yolk  sac,  while  at  day 
16  and  18  of  development,  higher  levels  of  expression  were  detected  in  inner 
compartments  of  bones  and  probably  match  areas  of  ongoing  ossification. 

In  order  to  determine  patterns  of  expression  of  different  ingl  transcripts  in  mouse 
tissues  and  during  embryogenesis,  Northern  blots  were  hybridized  with  probes 
corresponding  to  the  alternative  starts  of  ingl  (Figure  9).  The  1b  and  1c  variants  were 
expressed  ubiquitously  in  all  adult  mouse  tissues  tested,  with  the  highest  levels  in 
thymus.  Both  variants  were  expressed  in  the  embryos  at  all  stages  of  development 
analyzed,  with  the  highest  levels  at  day  7  in  the  case  of  1c,  or  day  1 1  in  the  case  of  the 
1  b  variant.  Out  of  all  adult  tissues  analyzed,  mRNA  for  isoform  la  was  expressed  only  in 
testis;  it  was  also  seen  in  the  1 1-day  embryo.  While  there  is  no  detectable  signal  with  the 
la  specific  probe  in  the  7  day  embryo,  traces  of  hybridization  could  be  detected  in 
mRNA  from  day  15  and  day  17  embryos,  which  indicates  an  extremely  low  expression 
level  at  these  points  of  mouse  embryogenesis. 

Expression  of  the  mouse  ingl  gene  in  cell  lines  depends  on  cell  proliferation 

To  check  whether  ingl  expression  was  cell  cycle  dependent,  we  analyzed  RNA 
from  two  mouse  cell  lines,  NMuMG  and  10(1),  at  different  stages  of  the  cell  cycle  by 
Northern  blot  hybridization  using  total  ingl  cDNA  as  a  probe.  Ingl  expression  was  also 
compared  in  senescent  versus  young,  dividing  mouse  embryonic  fibroblasts.  Results  are 
shown  in  Figure  11.  In  both  cell  lines,  ingl  was  expressed  at  higher  levels  in  dividing 


22 


compared  to  quiescent  cells  (quiescence  is  induced  either  by  serum  starvation,  contact 
inhibition,  or  y-irradiation).  However,  in  10(1)  cells  the  difference  in  expression  was 
specific  only  for  the  upper  ingl  specific  band,  which  corresponds  to  the  1b  variant.  In 
mouse  embryonic  fibroblasts  ingl  was  expressed  at  very  low  levels  without  any 
differences  between  dividing  and  senescent  cells. 

Ingl  expression  in  cell  lines  was  also  analyzed  by  Western  blotting  using  an  IgGI 
mouse  monoclonal  antibody  produced  against  human  recombinant  p33ING1  (8).  This 
antibody  has  previously  been  shown  to  specifically  detect  the  denatured  form  of  mouse 
p33ING1  protein  in  Western  immunoblot  protocols.  Cell  lysates  were  produced  from 
dividing,  contact  inhibited,  serum  starved,  or  y-irradiated  10(1)  and  NMuMG  cells.  Also, 
as  a  control,  cell  lysates  were  prepared  from  BOSC23  cells  transfected  with  mouse  the 
1b  ingl  variant  or  with  the  human  truncated  form  of  ING1  that  should  give  the  protein 
product  of  the  same  size  as  mouse  1c  and  la  variants,  respectively  (Figure  12).  In  both 
cell  lines,  two  protein  products  were  detected,  31  kD  and  38kD  in  size,  which  correspond 
to  the  truncated  ING1  and  1b  ingl  protein  products  respectively.  The  38kD  protein  was 
present  at  the  same  levels  in  dividing  and  quiescent  cells,  while  the  33kD  protein  was 
present  at  higher  levels  in  dividing  compared  to  non-dividing  cells.  The  variation  of 
expression  of  ingl  differs  significantly  at  RNA  and  protein  levels,  indicating  that  this 
gene  could  be  regulated  at  the  level  of  translation  or  protein  stability. 

DISCUSSION 

ING1  was  originally  described  as  a  gene  whose  suppression  promotes  neoplastic 
transformation  (5).  Consistent  with  that,  ectopic  expression  of  the  first  isolated  human 
cDNA  was  growth  suppressive  for  different  cell  lines.  ING1  was  found  to  encode  a 
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nuclear  protein  termed  p33ING1  that  was  shown  to  functionally  and  physically  interact 
with  p53  in  cell  growth  regulation  (7,  9).  Now,  after  we  have  characterized  the  structure 
and  expression  of  the  mouse  ingl  gene,  it  became  clear  that  ingl  regulation  is  more 
complicated  than  was  originally  thought.  Ingl  is  transcribed  from  at  least  three  differently 
regulated  promoters  and  the  resulting  transcripts  encode  at  least  two  different  proteins. 
All  the  transcripts  share  a  common  region  encoded  by  a  common  exon  but  differ  in  their 
5’-exons.  Two  of  these  alternative  exons  do  not  contain  protein-coding  sequences 
(isoforms  la  and  1c)  while  the  third  one  does  (isoform  1b).  Consistently,  one  of  the  ingl 
transcripts  encodes  a  37  kD  protein  (p37ING1)  while  two  others  are  translated  into  a 
shorter  protein  of  24  kD  that  surprisingly  runs  as  if  it  was  31  kD  (p31ING1).  Structures  of 
mouse  and  human  ingl  genes  are  likely  to  be  similar  considering  the  high  degree  of 
evolutionary  conservation  of  ingl  sequences  and  the  fact  that  the  two  versions  of  human 
ING1  that  are  currently  available  in  GenBank  also  share  a  large  common  part  but  have 
different  N-termini.  It  is  noteworthy  that  one  of  the  human  variants  of  ING1  (GenBank 
accession  number  AF001954)  does  not  have  a  homologue  with  similar  5’-sequences 
among  the  identified  mouse  ingl  isoforms  which  raises  the  possibility  that  there  could  be 
more  variants  of  mouse  ingl  transcripts  that  so  far  have  not  been  isolated. 

The  conserved  regions  in  the  3’-untranslated  sequences  of  ingl  (identified  by 
aligning  the  mouse  and  human  cDNAs)  may  potentially  be  involved  in  the  regulation  of 
mRNA  transport,  translation,  or  stability.  Remarkably,  one  of  these  regions  overlaps  with 
the  sequence  of  the  anti-/A/Gf  genetic  suppressor  element  (GSE)  that  is  known  to  be 
effective  against  both  human  and  mouse  genes  (5).  This  means  that  the  GSE  targets  a 
specific  region  of  the  mRNA  that  is  significantly  shorter  than  the  whole  GSE  sequence. 
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p33ING1  and  its  homologs  in  such  a  distantly  related  species  as  yeast  contains  a 
remarkably  conserved  PHD  finger  domain  and  an  additional,  weakly  conserved  domain 
of  unknown  function.  PHD  fingers  have  been  shown  to  bind  DNA  (20,  23)  whereas  to 
our  knowledge,  there  is  no  evidence  that  they  mediate  protein-protein  interactions. 
Therefore,  it  seems  likely  that  the  C-terminal  PHD  domain  in  the  p33ING  family  of 
proteins  is  involved  in  specific  DNA  binding  which  may  be  important  for  transcription 
regulation.  Given  the  outstanding  conservation  of  this  domain  across  the  large 
phylogenetic  distance  that  separates  humans  and  yeast,  the  specificity  of  DNA  binding 
with  respect  to  particular  binding  sites  is  expected  to  be  conserved  either  in  terms  of 
DNA  sequence,  distinct  features  of  chromatin  structure,  or  both.  It  may  be  further 
surmised  that  the  conserved  N-terminal  domain  is  involved  in  specific  protein-protein 
interactions  that  couple  transcription  regulation  by  p33ING1  and  its  homologs  to  other 
elements  of  cell  cycle  control.  The  conservation  of  domain  organization  in  p33ING1  and  its 
yeast  homologs  is  particularly  notable  given  that  yeast  does  not  encode  any  homologs 
of  p53.  Thus  it  appears  that  p33ING1  is  an  ancient  cell  cycle  regulator  whose  interaction 
with  p53  is  a  later  evolutionary  addition. 

The  results  of  expression  analysis  indicated  that  mouse  ingl  is  a  subject  of 
regulation  both  at  the  protein  and  mRNA  levels  and  that  different  isoforms  have  different 
expression  patterns.  Thus,  p37ING1  is  ubiquitously  expressed  in  all  tissues  analyzed  with 
elevated  mRNA  expression  levels  in  the  thymus,  while  levels  of  the  p31ING1  protein  vary 
dramatically  among  organs  and  cell  lines. 

Sizes  of  the  ingl  mRNA  transcripts  detected  by  Northern  hybridization  correlate 
well  with  the  length  of  cDNA  sequences  isolated  from  all  the  tissues  except  testis. 
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Probes  specific  for  each  of  the  identified  splice  variants  reveal  multiple  RNA  bands 
indicating  a  variability  of  the  ingl  transcripts  in  this  organ.  It  is  likely  that  in  testis  ingl 
mRNAs  are  either  terminated  or  processed  differently  from  the  rest  of  the  tissues. 

There  is  a  clear  link  between  the  proliferation  rate  of  the  cell  and  ingl  expression 
both  at  the  protein  and  mRNA  levels.  This  observation  may  reflect  cell  cycle 
dependence  of  ingl  transcription  detected  earlier  for  the  human  ING1  gene;  the 
biological  significance  of  this  regulation  is  not  obvious  so  far.  We  failed  to  observe  up- 
regulation  of  mouse  ingl  in  senescent  cells  as  it  has  been  reported  for  human  ING1  (6) 
which  suggests  that  regulation  patterns  of  mouse  and  human  ingl  genes  may  not 
always  be  the  same. 

The  complex  regulation  of  ingl  points  to  possible  functional  differences  between 
the  two  protein  isoforms.  In  fact,  a  similar  type  of  regulation  (alternative  initiation  leading 
to  variability  of  5’-exons)  found  in  other  tumor  suppressor  genes,  namely  BRCA1 ,  APC, 
and  INK4  (4,  17,  24,  25),  is  associated  with  the  generation  of  proteins  with  different 
functions  (in  the  case  of  INK4,  different  reading  frames  are  translated  in  the  alternative 
transcripts).  Experiments  on  functional  characterization  of  human  p38ING1  and  p31ING1 
showed  significant  differences  in  the  biological  activity  of  these  two  isoforms  of  the  ingl- 
encoded  protein  products  (Grigorian  et  al.,  in  preparation). 

One  of  the  key  questions  about  ING1  that  remains  unanswered  is  whether  the 
deregulation  of  this  gene  is  involved  in  naturally  occurring  cancer.  The  existence  of  at 
least  two  differentially  regulated  protein  products  encoded  by  ING1  with  potentially 
different  functions  clearly  means  that  any  attempts  to  investigate  the  potential  cancer 
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relevance  of  ING1  should  involve  separate  analysis  of  the  alternative  isoforms  of  this 
gene. 
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FIGURE  LEGENDS 


Figure  1.  Schematic  alignment  of  cDNA  sequences  isolated  by  the  cDNA  library 
screening  and  5’  RACE  with  human  ING1  cDNA.  A.  Mouse  ingl  cDNA,  isolated  from  the 
library  of  senescent  mouse  embryonic  fibroblasts,  aligned  with  human  ING1  cDNA. 
Open  reading  frame  for  the  human  ING1  clone  is  indicated.  B.  Alignment  of  mouse  and 
human  ING1  cDNAs  with  the  products  synthesized  by  5’-RACE.  The  position  of  the 
gene-specific  PCR  primers  used  for  the  RACE  reaction  is  indicated  in  panel  A. 

Figure  2.  Analysis  of  ingl  transcription  by  northern  hybridization.  Ingl  expression  in 
mouse  liver  heart  and  testis  was  analyzed  using  a  Multiple  Tissue  Northern  blot 
(CLONTECH  laboratories)  which  was  hybridized  with  the  probe  corresponding  to  the 
common  part  of  ingl  (1),  the  5’  end  of  isoform  la  (2),  the  5’  end  of  isoform  1b  (3)  or  the 
5’  end  of  isoform  1c  (4).  Probes  were  obtained  by  PCR  using  ingl -specific  primers. 
Arrow  shows  the  position  of  the  RNA  marker,  2.37kb  in  size. 

Figure  3.  Mapping  of  the  ingl  exons  in  the  mouse  genomic  region  containing  ingl 
cDNA-related  sequences.  Three  /ng  •/-positive  genomic  clones  were  isolated  from  the 
mouse  genomic  library  using  ingl  cDNA  as  a  probe.  To  determine  the  position  of  ingl 
exons,  DNA  was  cut  with  the  indicated  combination  of  restriction  enzymes  and  analyzed 
by  Southern  blot  hybridization  with  the  indicated  probes  representing  the  alternative  5’ 
ends  of  three  ingl  transcripts  (la,  1b  and  1c)  or  the  common  part  of  the  gene  (3’).  The 
results  of  hybridization  in  combination  with  the  PCR  data  and  with  partial  sequencing  of 
the  genomic  clones  allowed  us  to  determine  positions  of  the  ingl  exons. 
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Figure  4.  Determination  of  transcription  initiation  and  termination  sites  in  the  ingl  gene. 
The  upper  panel  shows  genomic  map  with  the  positions  of  the  ingl  exons,  and  the 
structure  of  ingl  cDNA  clones  as  determined  after  cDNA  library  screening  and  5’-RACE 
analysis.  The  structure  of  two  known  human  ING1  cDNA  variants  is  indicated  and  their 
relation  to  mouse  ingl  sequences.  The  lower  panel  demonstrates  the  the  final  structure 
of  the  ingl  gene  determined  as  a  result  of  precise  mapping  of  transcription  initiation  and 
termination  sites.  In  the  upper  panel,  the  position  of  the  G/C  rich  region  in  the  5’  end  of 
clone  1b  and  the  poly(A)  region  in  the  3’  part  of  the  ingl  transcripts  are  indicated,  which 
interfered  with  the  polymerase  reaction  during  the  original  5’-  RACE. 

Figure  5.  Sequences  of  the  putative  promoter  areas  of  the  ingl  gene.  A.  Sequence  of 
the  genomic  region  with  the  putative  promoter  areas  of  isoforms  1  a  and  1  b,  the  start 
sites  of  transcription  of  these  two  variants  are  shown  by  arrows.  The  initiator  sequence 
of  the  la  isoform,  including  multiple  initiation  start  sites,  is  underlined;  TATA-like 
sequence  for  the  1b  isoform  is  shown  in  bold.  Underlined  bold  sequence  is  CAAT  box 
positioned  about  lOObp  upstream  from  the  1b  transcript  initiation  site.  Sp-1  binding  sites 
are  displayed  in  bold  italic.  B.  Sequence  of  the  putative  promoter  area  of  the  1c  isoform, 
with  the  underlined  initiator  sequence  overlapping  the  two  transcription  start  sites. 
Multiple  sp-1  binding  sites  are  also  indicated. 

Figure  6.  Sequences  of  ingl  cDNAs  and  their  predicted  protein  products.  Sequences  of 
the  alternative  la,  1b,  and  1c  first  exons  as  well  as  the  common  ingl  exon  are  shown. 
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The  first  ATG  codon  of  isoform  1b  is  underlined,  as  well  as  the  ATG  codon  in  the 
common  exon  that  is  used  as  the  initiation  codon  for  la  and  1c  translation.  Stop  codons, 
indicating  the  end  of  translation,  are  also  underlined. 

Figure  7.  Comparison  of  mouse  and  human  Ingl  genes.  A.  Alignment  of  amino  acid 
sequences  of  mouse  ingl  1b  isoform  and  its  human  orthologue.  Numbers  of  amino 
acids  are  indicated;  numbers  in  parenthesis  indicate  the  size  of  the  truncated  protein 
product  of  la  and  1c  ingl  isoforms.  The  underlined  methionine  in  the  mouse  sequence 
indicates  the  beginning  of  the  protein  product  encoded  by  la  and  1c  isoforms.  Asterisks 
indicate  identical  amino  acids,  while  dots  indicate  conserved  changes  in  amino  acid 
sequence.  B.  Alignment  of  the  3’-untranslated  regions  of  human  and  mouse  Ingl 
mRNAs.  The  position  of  anti-INGI  GSE  sequence  (5)  is  underlined. 

Figure  8.  A  multiple  alignment  of  P33ING1,  its  yeast  homologs,  and  additional  PHD 
finger-containing  proteins.  The  aligned  conserved  blocks  are  separated  by  variable 
spacers  whose  length  is  indicated  by  numbers;  for  the  N-terminal  block,  the  distance  to 
the  protein  N-termini  is  indicated  (the  sequence  of  the  human  paralog  of  P33ING1  is 
incomplete).  Consensusl  shows  amino  acid  conservation  in  the  P33ING1  family  of 
proteins;  consensus2  shows  the  conservation  in  an  expanded  set  of  PHD  finger  proteins 
(in  addition  to  the  6  proteins  of  the  P33ING1  family,  the  sequences  that  aligned  with 
P33ING1  with  a  probability  of  a  random  match  below  10'4  in  the  first  iteration  of  the  PSI- 
BLAST  analysis  were  included).  Each  consensus  shows  amino  acid  residues  conserved 
in  all  sequences  of  the  respective  set;  h  indicates  a  hydrophobic  residue,  p  indicates  a 
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polar  residue,  indicates  a  negatively  charged  residue,  s  indicates  a  small  residue,  and 
a  indicates  an  aromatic  residue.  The  metal-chelating  residues  in  the  PHD  finger  domain 
are  shown  by  inverse  typing.  Each  protein  is  identified  by  a  gene  name  followed  by 
species  abbreviation  and  the  Gene  Identification  number  from  the  non-redundant  protein 
database  at  the  NCBI.  P33ING1p  is  the  human  paralog  of  P33ING1,  Mi-2  is  a  human 
autoantigen,  CHD3  is  a  chromatin-associated  helicase  of  the  SNF2  family;  the  remaining 
proteins  are  uncharacterized  gene  products  designated  by  their  systematic  gene  names. 
Species  abbreviations:  Hs,  Homo  sapiens,  Sc,  Saccharomyces  cerevisiae,  Sp, 
Schizosaccharomyces  pombe,  Ce,  Caenorhabditis  elegans. 

Figure  9.  ingl  expression  in  adult  mouse  tissues  and  embryos  analyzed  by  northern 
hybridizations.  A.  Hybridization  result  with  the  indicated  probes  with  total  mouse  RNAs 
isolated  from  the  indicated  organs;  photograph  of  ethidium  bromide  stained  gel  is  shown 
as  loading  control.  B.  Northern  hybridization  with  the  same  probes  of  RNA  on 
CLONTECH’s  mouse  embryonic  Multiple  Tissue  Northern  blot.  Here,  mRNA  was 
isolated  from  mouse  embryos  at  7,  11,  15,  and  17  days  of  development.  Arrows  indicate 
positions  of  the  1.35kb,  2.37kb,  and  4.4kb  RNA  markers  as  well  as  the  positions  of  28S 
and  1 8S  rRNAs. 

Figure  10.  ingl  expression  in  mouse  embryos  analyzed  by  in  situ  histo-blot 
hybridization.  Histo-blots  were  prepared  from  whole  body  sections  of  10-,  12-,  16-  and 
18-day  embryos,  and  hybridized  with  the  sense  and  antisense  ingl -specific  33P-labeled 
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RNA  probes  corresponding  to  the  PHD  domain.  Antisense  actin  probe  was  used  as  an 
internal  standard. 

Figure  11.  ingl  mRNA  expression  varies  depending  on  cell  growth  conditions.  RNA  was 
isolated  from  dividing  10(1)  (a)  and  NMuMG  (e)  cell  lines,  as  well  as  from  the  dividing, 
young  mouse  embryonic  fibroblasts  (h).  RNA  was  also  isolated  from  contact-inhibited 
10(1)  (b)  and  NMuMG  cells  (f);  serum  starved  10(1)  cells  (c)  (64  hours  at  0.5%  FBS);  y- 
irradiated  10(1)  cells  (d)  and  NMuMG  cells  (g);  and  senescent  MEFs  (i).  Filters  were 
probed  with  total  ingl  probe.  Bottom  panels  show  RNA  gels  from  which  northern  blots 
were  made.  Arrows  indicate  positions  of  the  2.37kb  RNA  marker  (upper  panels)  or  28S 
and  18S  rRNAs. 

Figure  12.  Expression  of  ingl  protein  products  at  depends  on  cell  growth  conditions. 
Cell  lysates  analyzed  by  western  with  antibodies  against  p33ING1  were  isolated  using 
RIPA  buffer  from  the  cells  treated  in  the  same  way  as  explained  in  Figure  1 1 . 
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CGATGG GGGC  GGAAGGCGCA  GGCGCGTGGA  GGCGCCGAGG  ATGCTGGGAG  TGGTGGTCCC 
TTTGCGTGTC  GATAGCTGCC  CTCCCGCGCG  GGTCCGCATC  CATCGCCAGG  GAGCTGCGCC 
m - 1 - *la 

TGCGCGTCCA  CTTCCGCCTC  GCTCGGCTCC  GCCCGCTCGT  CCCTCCTGGA  AGAGGACAGC 
TCTCTAGTAC  TAGGCGTGCG  GGGGGCGTTG  CCGATCCCAC  CACACCTCCT  TCTCGTCCAG 
ATTGGCCGCA  GAGGTTAGAG  TCGCAAGGGG  ATTGGCTGAC  GCCGTTGCCA  TTTCTGTTCG 

exon  - 1  intron  < - ►  lb 

GGGGCCCTAT  TTATACCGCG  CTAGGAGGCG  GGGGAATGCA  GTTAACCCAG  CGGAGGGTGG 
CTCTTCTCAG  GTTAGTCGGT  TTGAAGGCAG  CGATCTGGAG  GCTCGTCCGG  CCCCGGCGTG 
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Figure  8  (Zeremski  et  al.) 
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Fifure  9  (Zeremski  et  al.) 
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Figure  12  (Zeremski  et  al.) 


